Improved speech recognition word lattice translation by confidence measure

نویسندگان

  • Abdulvohid Bozarov
  • Yoshinori Sagisaka
  • Ruiqiang Zhang
  • Gen-ichiro Kikui
چکیده

In conventional speech translation systems, Automatic Speech Recognition (ASR) produces a single hypothesis which is then translated by the SMT system. The translation results of SMT system are impaired by the word errors of the first best hypothesis in this approach more or less. To improve speech translation, we use a new word lattice translation approach which integrates multiple information sources from the speech recognition word lattice to discount the misrecognition. Furthermore, in order to improve speech translation and to reduce computation, we used N-bests cutoff, merging of identical word ids, and confidence measure. Experiments of Japanese-to-English speech translation showed that the proposed word lattice translation outperforms the conventional single best method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A decoding algorithm for word lattice translation in speech translation

We propose a novel statistical machine translation decoding algorithm for speech translation to improve speech translation quality. The algorithm can translate the speech recognition word lattice, where more hypotheses are utilized to bypass the misrecognized single-best hypotheses. We also show that a speech recognition confidence measure, implemented by posterior probability, is effective to ...

متن کامل

Integration of speech recognition and machine translation: Speech recognition word lattice translation

An important issue in speech translation is to minimize the negative effect of speech recognition errors on machine translation. We propose a novel statistical machine translation decoding algorithm for speech translation to improve speech translation quality. The algorithm can translate the speech recognition word lattice, where more hypotheses are utilized to bypass the misrecognized single-b...

متن کامل

Improving Speech-to-Speech Translation Using Word Posterior Probabilities

Nowadays, speech translation is a research problem in machine translation. The problem arises as to how to combine speech recognition and machine translation in a suitable way. Some authors have shown that the speech translation can be improved by using word lattices as input of the translation system. The acoustic recognition scores from the word lattice are used for improving the translation ...

متن کامل

Exploiting deep neural networks for detection-based speech recognition

In recent years deep neural networks (DNNs) – multilayer perceptrons (MLPs) with many hidden layers – have been successfully applied to several speech tasks, i.e., phoneme recognition, out of vocabulary word detection, confidence measure, etc. In this paper, we show that DNNs can be used to boost the classification accuracy of basic speech units, such as phonetic attributes (phonological featur...

متن کامل

Using Word Lattice Information for a Tighter Coupling in Speech Translation Systems

In this paper we present first experiments towards a tighter coupling between Automatic Speech Recognition (ASR) and Statistical Machine Translation (SMT) to improve the overall performance of our speech translation system. In coventional speech translation systems, the recognizer outputs a single hypothesis which is then translated by the SMT system. This approach has the limitation of being l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005